This blank page was inserted to presem pagination. 



QUEUEING MODELS FOR PILE MEMORY OPERATION 

by 

PETER JAMES DENNING 

B.E.E., Manhattan College 
(196*0 



SUBMITTED IN PARTIAL FULFILLMENT OF THE 

REQUIREMENTS FOR THE DEGREE OF 

MASTER OF SCIENCE 

at the 

MASSACHUSETTS INSTITUTE OF TECHNOLOGY 



rloal Engineering, Maytfel, 



Signature of Author , 

Department of Electrical En&ineerlng, Maytfel, 1965 

Certified by . <shg<t& 4fJfA>Ws%<lS*- 

lis, Assoc. Prof. Elec, Eng. 




Accepted by ^^%^t^^^t^^ ^ ^fr^&L. 

Chairman, Departmental Committee on Graduate Students 



QUEUEING MODELS FOR PILE MEMOBY OPERATION 

by 

PETER JAMES DENNING 



Submitted to the Department of Electrical Engineering 
on May 21, 1965, in partial fulfillment of the 
requirements for the degree of Master of Science. 



ABSTRACT 



A model for the auxiliary- memory function of a 
segmented, multl -processor, time- shared computer 
system Is set up. A drum system In particular Is 
discussed, although no loss of generality Is Implied 
by limiting the discussion to drums. Particular attention 
Is given to the queue of requests waiting fro drum 
use. It Is shown that a shortest aocess time first 
queue discipline Is the most efficient, with the 
access time being defined as the time required for 
the drum to be positioned, and Is measured from the 
finish of service of the last request to the beginning 
of the data transfer for the present request. A 
detailed study of the shortest access time queue Is 
made, giving the minimum aocess time probability 
distribution, equations for the number In queue, 
and equations for the wait In the queue. Simulations 
were used to verify these equations; the results 
are discussed. Finally, a general Markov Model for 
Queues is discussed in an Appendix. 
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CHAPTER I. INTRODUCTION . 

tfith the advent of more and more complex computing 
systems it has become increasingly important to have some 
reliable means for evaluating the performance of the system. 
In the Compatible Time-Shared System (CTSS) at Project MAC (2), 
M.I.T., for example, the scheduling of users is a problem 
that is receiving much attention. Patel (1*1-) has considered 
first-come-first-served allocation of processor resources 
to users, and a multiple-level dynamic priority scheduling 
algorithm which closely models the scheduling algorithm used 
in CTSS (2). Heller (10), on the other hand, has considered 
the more general problem of a multiple-processor time-shared 
system. The purpose of the scheduling algorithm is to allocate 
the processor resources as efficiently and equitably as 
possible, minimizing processor idle time and user waiting 
time. Various schemes for scheduling have been tested at 
MAC but the one described by Patel has proved most satisfactory. 
Scherr (1?) has made a far-reaching study of CTSS-like systems, 
with particular emphasis on their Markovian aspects. 

Before the user's waiting time can be minimized it is 
necessary to minimize the processor idle time. One of the most 
inefficient operations is the swapping of information between 
the core memory and the drum or disc files. Oftentimes the 
processor must stand idle during a swap, awaiting the arrival 
in core of a block of data. One way to ease this difficulty 
is to use 
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one or more processors and let several programs occupy core 
at once. Then during the time that the swapping for one 
program is taking place, the processors oan be kept busy on 
other probrams. In this way overall processor idle time oan 
be reduced. These ideas of mult lprogrammlng and multiple 
processors are not new; it is only recently that computer 
hardware has become sufficiently sophisticated to handle the 
task effectively. 

Additional alleviation of the swapping problem can be 
effected by making drum and disc file operation as efficient 
as possible. In single -program systems efficiency of drum 
operation Is not a problem since only one program (the program) 
can demand use of the drum at a time. Clearly, in a multi- 
programmed system several programs can make simultaneous 
demands on the drum and disc facilities, making special 
organization a must to minimize the waiting time of a given 
program for its request to be serviced, and at the same time 
minimizing Idle time of the entire system. It is clear 
that in a poorly organized drum system the inefficiency of 
the drum system can seriously impair the operation of the rest 
of the computing system because continued operation often 
depends on the reading of information into ooret a program 
cannot begin to operate a segment until that segment has been 
placed in core. For instance, suppose we had at our disposal 
the means to reduce the average service time of a drum request 
by two or three milliseconds. In the two or three milliseconds 



saved much computation can be performed. 

In this paper we consider a model for a drum file 
memory system, and in particular a model for the programs 
in suoh a system. The model will describe the manner In 
which a program (or more properly, a process ) makes requests 
for file memory use. A computer simulation has been written 
for the particular model described. In Chapter 4 a pertinent 
mathematical model is given. In Chapter 5 the results obtained 
from this model are compared with the results obtained from 
the simulation. The interested reader is referred to 
Scherr (17) and to Appendix 4 for an outline of the complexity 
of even the most tractable of models, the Markov Model. 



CHAPTER 2. BACKGROUND . 

It is the purpose of this section to disouss some 
of the concepts upon which this paper is based. One of the 
problems of existing time -shared systems is that the processor 
must stand idle while the present and next user's programs 
are being swapped in or out of core. One proposed solution 
to the oroblem is to run one user's program in core, meanwhile 
swapping the next user into a remaining part of core. Then 
the processor would be switched to the next user, and the 
swapping operation would begin anew. Of course each user 
would not be arbitrarily assigned half of core, but programs 
would be matched in some opmplementary manner long ones with 
short ones. This mode of overlapped operation in a time-shared 
system is sometimes referred to as a ring (cf. Scherr (17)). 
Again, idle processor problems arise if one program should 
require all available core space. Then no simultaneous 
swapping could take place, 

A generalization of the above solution to the problem 
has been considered at length by J.B. Dennis and E Van Horn (3»6). 
It is known as segmentation . Under this scheme a user's 
program would be divided into a set of individually named 
pa^ts, called segments. The user is assumed to have segmented 
his program in the way which seems most appropriate to him. 
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Segments may be classified roughly according to the 
manner In which they may be accessed: 

(1) Read-Only. 

(2) Data. 

(3) Pure procedure. 

Some combinations of these classes are permitted. 

A pure procedure segment is a set of instructions 
which directs a process* to operate on data but not on itself. 
Thus we oould ask the oompiler to extraot all the symbols, 
variables and so forth, from a program and group them into 
one segment; procedure segments would then be allowed to 
modify and use this data. Of course certain programs, notably 
short ones, would be contained entirely in one segment, 
Read-Only data might be input data, format specifications* 
and so forth, which are not altered by the processes in a 
user's computation. Operation of a program might be in the 
following manner. Some first segment would be brought into 
core, together with all necessary data segments, which may or 
may not include read-only data. Then segments may act 
singly or in groups (if several processors are available) 
on the data. New segments are brought in as needed (when 
a reference is made to a segment not already in core) . The 
programmer may wish to declare subroutine segments , 
which might contain some of 



* 
A process is carried out by a processor under the direction 
of instructions in procedure segments. (3,^.7). 
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his often-used subroutines, and which for efficiency 1 sake 
should be kept handy In core at all times. Of course certain 
subroutines, such as printing or exponentiation subroutines, 
might be kept in special common , or library segments , being 
available for the common use of all users. In this way each 
user would not need to be given his own copy of each and 
every library routine. Figure 2.1 suggests the operation 
of the system, showing a time sequence of groups of segments 
operating on data. The time sequence may not be in the order 
in which the segments were written, and the same segment may 
appear many times In the sequence. Several processors might 
be available to work for one user, so that several segments 
might be active at once. Note that we have indicated that 
the segments are in general of various lengths. Note too 
that read-only data may not need to be present in core ( main 
memory ) but may be referenced from, say, a drum memory 
( auxiliary memory ) as needed. 

Clearly, by writing programs in segments, only a few 
segments of a given program need be in core at once, the rest 
being stored in auxiliary memory, perhaps on a high speed 
drum. A segment in core which is being used by one or more 
processes is called a working , or active segment. Segments 
kept on the drum are called dormant segments. Many users, 
or course, can have segments working simultaneously if there 
is more than one processor available. When a segment is 
working it can have one or more processes taking place in it, 
depending again on how many processors are available to work 
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Figure 2.1 . Operation of one user in a segmented system. 
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on it. Hence when talking of the computations within a 
user's set of segments, we shall apeok of a user's processes 
rather than a user's program. 

Each segment will be named in some arbitrary manner. 
When a process makes reference to a segment (by naming it 
and giring the address of some word within it) which is 
not in core, that process is temporarily suspended until 
the required segment is brought Into core. Since many users 
may have simultaneous processes it will be necessary to have 
some central oontrol over allocation, swapping, and so forth. 
The program which does this Job is called the Supervisor 
program* When a process references a segment other than the 
one in which it is taking place, the Supervisor will transfer 
control to that segment if it is in core. Otherwise that 
process must halt until the Supervisor has brought in the 
needed segment. With many processes running there will be 
a great demand for drum usage. We think of a process pausing 
a request to be made to the drums for information, rather than 
the process itself making the request. We can see that 
references to other segments are at arbitrary points in time, 
and may be to arbitrary segments, which may have arbitrary 
length. 
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If requests should be generated momentarily faster 
than they can be serviced, then the waiting requests must 
be places in a waiting line, or queue . The order in which 
requests are serviced (i.e., the order in which they leave 
the queue) is not necessarily the order In which they arrived 
at the queue. We can see three distinct parts of the data 
transmission function of the computing system: the users' 
processes , which generate requests (either to read or to 
write on the drum); a queue into which requests that have 
to wait are placed, and which has a selection rule for next 
out, called the queue discipline : and finally the drums . 

One final word must be said, concerning the trans- 
mission of data to and from the drums. It seems both desireable 
and convenient to have some standard unit of transmission 
and allocation , which we call the page. It is always 
possible to store pages consecutively on the drum (see Section 
3.4). This requires that there exist some mechanism for 
deleting unnecessary data from the drums. One possible 
mechanism, using a percentage level of drum ocoupanoy . is 
discussed in Section 3.4. It Is necessary for the Supervisor 
to maintain some level of drum occupancy, and to have a 
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deletion policy In order to keep the drum from overflowing. 

We will see under our study of queues In Seotion 3.3 
and Chapter **■ that for each request there is a certain drum 
positioning time, or access time , that must pass while the 
requested starting page comes opposite the drum's read-write 
heads. This aocess time is wasted time. We seek to 
minimize it. 

There are two general methods of handling oore allocation, 
and It is not clear which method is more desireable. One 
method is called page -turning , the other segment-turning . 
Under both methods, a set of pages will be grouped as a 
segment and given a name. Under segment-turning a whole 
segment is brought Into core and kept there at least until 
the various processes are finished with it. Under page-turning, 
one page of a segment at a time is brought in, and a new 
page is brought in only when needed. Under page -turning 
unneeded pages are deleted singly, while segment-turning 
deletes the entire set of pages belonging to a segment if 
any one of them is dele table. Page-turning seeks to minimize 
wasted oore space; segment-turning seeks to minimize overall 
processing time per user. Each method has its advantages 
and disadvantages. There is some evidence that neither is 
better (of Scherr*s Thesis, wheie It is shown that the 
scheduling and computational time quanta do not significantly 
affect system operation (17)). This paper assumes a segment- 
turning system. 
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In conclusloht when a user's process refers to 
another segment that is not present in core, it will oause 
the Supervisor to generate a request to the drums. Ordinarily 
a request will be a read request, but it might also be a write 
request if the referenced segment is one in core being declared 
in the reference as •dormant"; or it may be a delete request 
if the referenced segment is being declared "dead". The 
queue will contain the waiting requests, while the drums will 
service them. A proper deletion policy is needed. Finally 
it is clear that the unit of information transmission ought 
to be the page, but the core memory allocation question, namely 
whether to allocate in pages or in segments, is open for 
discussion. 
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CHAPTER J. THE DRUM SYSTEM . 
3.1 . Introduction . 

The system model described here consists of three 
elements: the Users' Processes, the Queue , and the Drums . 
The Users' Processes element models requests to the Drums 
to read, write, or delete. The Prooesses will make requests 
at certain intervals given by some inter-request-time probability 
distribution; they will request some quantity of data in units 
of pages, beginning at a speoified location on the drum. Several 
drums may be present, so each request will specify whioh 
drum is involved. Delete requests will be sent directly to 
the drums, while read and write requests will be entered in 
the Queue. The Queue will contain a list of which processes 
are requesting how much data from (or to) what drum, andthe 
starting location of the drum. It will act according to 
some queue discipline to decide which request is next to reach 
the drum, and will assign the request to a free channel to 
the requested drum. When a request is assigned to a channel it 
is deleted from the Queue. When a drum is notified by the 
Queue that a request is assigned to a channel it takes note 
of what program has been assigned to the channel, what the 
desired starting location and field are, and whether the 
request is a read or a write. A certain amount of time 
must elapse before the desired location has revolved into 
position; this time is the access time . Once the desired 
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starting position has come opposite the drum heads the data 
transfer begins, and ends after a certain amount of time, 
the transfer time , has elapsed. The sum of the access time 
and the transfer time is called the service time . The 
channel idle time is the time during which the channel has 
no request assigned to it. There may be some question whether 
access time should be included in channel idle time. Since 
access time directly affects a given request's wait before 
the end of its service, we h^ve included it in the service 
time. Figure 3.1 shows the system in block diagram form, 
as we have just outlined it. 

We now give a complete description of each element 
starting with the most basic, and most probabilistic, the 
Users' Processes. 



3.2 . The Users' Processes Model . 

In order for proper control of all computing facilities 
to be maintained, the individual processes in core do not 
make requests directly to the queue and drums. As discussed 
in Chapter 2, a request originates from the Supervisor, the 
program which controls allocation and proper operation of 
the system facilities. The Supervisor can prevent interaction 
between processes, providing protection against such 
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happenings as some process erroneously requesting to write 
on top of another's information. The Supervisor will contain 
the queue. 

In order to promote efficient operation, program seg- 
mentation will be used (3*6). By breaking the program into 
segments, efficient use can be made of core memory, since 
those segments of a program in which no processes are presently 
taking place should be stored on the drum and should not be 
"cluttering up" core. When a process references a segment 
not in core, the Supervisor will request that the next 
segment or segments be brought into core. Clearly while the 
next segment or segments are being read into core, any waiting 
processes are suspended; hence our first assumption: 

Assumption 1. Once a process has caused a request for 
one or more segments to be read in, it is temporarily 
syspended until its new segments are brought In. In 
particular a process will be unable to cause further 
requests until at least the time when it is resumed. 

On the other hand, during the course of computation a 

process may generate some output data in core and request 

that this data segment be stored on the drum, for example 

so that it can reuse the same core space for further data. 

Such write requests do not imply that the process must come 

to a halt, hence our second assumption: 

Assumption 2. Upon generating a write request a 
process may continue, and In particular it may cause 
further read or write requests while a write 
request is being serviced. 
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From the above discussion, we may expect that a read 

request is more probable than a write request, and so our 

third assumption: 

Assumption ^. The probability of a process causing 
a read request is not the same as that of it causing 
a write request, and in general the probability of 
a read is greater than that of a write. 

In order to simplify space allocation on the drums, 

the surface of the drum will be divided into blocks, or 

pages, consisting of some fixed number of words. Thus the 

number of words per page is fixed, and 

Assumption ^. The unit of information transmission 
and storage will be the page. 

We have no reason to assume that the number of pages in an 

arbitrary segment is fixed; in fact all we can say is that 

long segments (those with many pages) will be unlikely as will 

extremely short segments (for example one or two pages). 

The number of segments in a block of n pages ia a random 

variable, and in particular the probability of finding 

exactly n pages in s segments may be given by a discrete 

Poisson Distribution: 

P(s , n) „ l££l£ e -n/N W.2-" (1) 

where the mean number of pages per segment is N. 

Consider this problem: if a process should reference 
more than one segment not in core, so as in initiate the 
read-in of several segments, should the Supervisor ask for 
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the several segments In a single request, or should It 

make separate requests, one for each segment? We are assuming 

that it is always possible to store the pages of a given 

segment sequentially on the drum, that is thqt we can always 

read or write a segment without interrupting the transmission 

between start and finish. How this is done is considered in 

some detail in Section 3.*f. For three reasons we argue that 

in the event of need of several segments contemporaneously 

there should be a separate request made, one for esch segment. 

First, since consecutive segments may not be all written 

at once, but may have been written at widely spaced intervals, 

and independently or each other, it is unreasonable to assume 

that segments will always be stored consecutively; although 

this could be done by the method of Section 3.^. Second, 

there is no assurance that the requested segments will all 

be on the same drum, or that the request will even be for 

consecutive segments. Finally some queue disciplines 

discriminate against long requests, servicing those requiring 

the shorted service times first (Section 3.3) » asking for 

several segments in one request could well result in an 

Inordinately long wait for service under such a queue discipline. 

We now make our fifth and sixth assumptions. 

Assumption jj. Each request will be for one segment, 
but at a request time a process may cause several 
requests. The probability that s segments will be 
requested will be exponential, that is 

P{s) » e" s s=l,2,... (2) 
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Furthermore at request time there is no reason for 
all the requests to be either all read or all write; 
they may he mixed, A read request, or course, will 
cause suspension of the process. 

Assumption 6. The number of pages in the single 
segment of each request will have probability of 
being n pages 

P(n) = * e- n/ * n*0,l,2,... (3) 

where N Is the mean. 
When a segment is active, that is, when processes are 
referencing it, the probability that the next requests occur 
at each successive time instant are independent so that we 
expect the arrival times or requests to be Poisson 
Distributed. A request is unlikely to be made immediately 
after resumption of a process from the last request, and it 
is unlikely to be made an extremely long time after the 
resumption of a process. The probability of exactly k 
requests in a time interval t is 

P(*,t> = *^£ e- at t > W 

where a is the average number of arrivals per unit time. 

We have then 

Assumption 2« The inter-request times are taken from 
the following distribution* 

P(t) « ae" at t > (5) 



*See page 60. 
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Something must be said about the starting position of 

the drum a particular request will seek. We have no information 

to allow us to assume anything other than that all drum 

positions are equally likely to be requested* 

Assumption 8. At a particular request time all drum 
positions are equally likely to be selected j that is, 
the density of angular positions requested will be 

P(0) - ^- < < 2it (6) 

Finally something must be said about whloh drum is to 

be requested, in the event that there are several drums in 

the system. When a process is making requests for several 

segments there is no reason to assume that all the requested 

segments will be on the same drum. Hence we are willing to 

say that each of the D drums is equally likely to be requested i 

Assumption £. Each request is equally likely to 
be for any of the drums in the system. 

Assumptions 3»5t7»8» *n& 9 arc illustrated in Figures 3.2 

to 3.7, 

Based on the discussion above, we are in a position to 

construct a model for the request aotivity of a given 

prooess. This model is shown in Figure 6 9 + 

*A Note on Notation: A fork is a point at whloh one process 
splits into two processes, which follow their own paths. A 
,1oln is Just the opposite, where two processes become one; each 
time the Join Is entered the operations In the box of the 
flow chart are oarried out. An arrow doing this m I 
is a termination of a process. A note in brackets gives the ■ 
condition, permitting a process to emerge from the corres- 
ponding box. A function written with an argument (.) denotes 
a probability funotlon for a set of identically distributed 
random variables. 
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Figure 3.2 . Relative frequencies of read and 
write requests. 



nseg(.) 
\ 



\ 



*s. 



T^T-T-r- 



Figure 3.3 . Relative probabilities of number of 
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Figure 3.5 . Relative probability of inter- 
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drum position. 
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Figure 3.7 . Relative probability of requested drum. 
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Send n to 
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Send npg(.),drmpos(.), 
drm( . ) , id to Queue . 



[request 1 
finished] 



[teis] 
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Send drmpos(,), n, 
wr to Queue* 



[request | 
finishedj 



trq(.). 



Walt until time - time + w 



•The delete meohanism 
is discussed in Seotion 
3.*. 

nseg(.) ■ no. of segments requested 
rdwr(.) » rd-wr distribution 
npg(.) * no, pages requested 
drm(.) - requested drum 
drmpos(.) ■ requested drum position 
n m no* pages this request 
ns » temporary segment count 



Figure 3.8 . The Processes Model. 
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The reader may be asking what justification there is 
for assuming the particular probability distributions that have 
been chosen; in particular why we have chosen Pols son 
distributions as opposed to other distributions. It will be 
noted that these choioes are completely arbitrary , and cannot 
be properly determined until some statistics are available 
about the system we are discussing* It is felt that the 
assumptions that have been made are reasonable. 



3.3 , The Queue . 

The model of the queue is more straightforward and 
deterministic than the model of the processes. When a request 
is received from a prooess it is entered in a list within the 
Queue Element, Eaoh entry in the list contains the following 
information t an identification number of the prooess requesting, 
the number of pages involved in the transmission, the desired 
starting location on the drum, the Identification number of 
the desired drum, and whether the request is a read or a write. 
The number of pages Is an Important piece of Information sinoo 
it can be used to determine when the transmission is ended. 
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A possible structure for the Queue* s list, which we 
will refer to simply as the queue . is shown in Figure 3.9. 
In this list two pointers are used, one to Indicate the lower 
limit of the number in the queue (the shaded region), the other 
to indicate the upper limit. Both pointers are periodically 
incremented and are modulo capacity of queue . The lower pointer 
is moved down one position each time a new entry is made, and 
the upper pointer is moved down one position each time a 
request leaves the queue. If the next out is not the least 
recent entry, then all items above are moved down one position 
to fill the gap. The shaded area represents the number in 
the queue, frequently referred to as the length of the queue. 

There is a Boolean signal received from each of the 
drums indicating whether or not that drum is busy (all 
ohannels to it in use). Whenever all channels to a drum 
are busy, any requests arriving for that drum must wait in 
line, and a waiting line, or queue, is formed. If requests 
arrive too much faster than they can be serviced, the length 
of queue could become equal to its capacity and any further 
requests will be lost . Such a development is disastrous, 
since it would render a process useless. Hence the average 
arrival rate must not exceed the average service rate , where 
the rates are defined to be the reciprocals of the 
average interarrival and service times, respectively. 
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Figure 3.9 . A structure for the queue stack. 
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When the Queue Is aware that a drum Is not busy, it 
looks down the list to determine which if any requests want 
the free drum. It then chooses one of them according to the 
queue discipline, assigns it to some free channel to that 
drum, the deletes the entry from the list* 

The queue discipline is simply the rule for selection 
of next out . We consider four queue disciplines applicable to 
our situation: 

(1) First come, first served. 

(2) Shortest access time first. 

(3) Shortest Job first. 
(k) Mixed policy. 

(1) First come first served . 

This is the "fair" or "equitable" queue discipline, 
where requests are serviced in the order of their arrival, 
and is the case when the "next out" of Figure 3.9 is the 
"latest entry". It does not result in the most efficient 
operation. It is analogous to the normal situation encountered 
in a post office, when one wishing to buy a single stamp 
must wait behind a person with several packages. Certainly 
the waiting time is greatly increased because of ill fortune, 
whereas the person ahead would not be significantly delayed 
to give way. Since a process is equally likely to ask for 
any drum position, and since the present drum position is 
likely to be anything, with the first come first served queue 
discipline the average access time is half the drum revolution 
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time* Lot us represent the time a request Is In the service 

system (the time from when a process makes a request until 

the time service is completed) by T s . Let the drum revolution 

time be T. Let the average transfer time be T t # And let 

the average wait in the queue be W - Then, for first oome first 

served, 

T. - tf + TV + T/2 (1) 

o q ■ * 

(2) Shortest Access Time First . 

Under this queue discipline the next out is selected 

according to following rules 

Choose the one for which the rotational positioning 
delay until the desired starting address is minimum* 

Now if more than one request for a given drum is in the queue, 

on the average the access time will be less than half the 

drum revolution time; this is so since with more in the 

queue the probability that there is a request for the present 

drum position is greater than for a queue of length one. It 

will be shown later that the minimum access time is roughly 

inversely proportional fo the length of the queue. Hence 

for this queue discipline 

T m V* + T. + T/n (2) 

a q TJ 

where n is the average number in the queue, and Wq is 
not the same number icmlly as for the first oome first served 
queue with the same n. Observe that the shortest access time 
queue is a dynamic priority queue, one for which the priorities 
of requests are changing randomly. 
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(3) Shortest Job First . 

Under this queue discipline the following rule Is 
used to select the next outt 

Seleot the request for which the servloe time Is 
a minimum. The service time Is the sum of the access 
time and the transfer time. 
A little thought should convince the reader that under this 
queue discipline the access time is not minimized, but yet 
It will In general be less than T/2 for queues of length two 
or more. Hence 



T„ * W„ ♦ T* + T* 
s q x 



(3) 



where T/n < *• < T/2 , and W Is not the same numerically 
as for either a first oome first served or shortest access 
queue having the same length n. 
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(4) Mixed Policy Queues . 

It will be noted that the shortest access time queue 
and the shortest Job first queue are queues in which a continuous 
number of priorities exist. Suppose we beoome concerned 
about requests which might have to wait an Inordinately 
long time, perhaps because of ill fate, perhaps because its 
job time is long. This could be a real problem in the shortest 
.1ob first case (what of the longest Job of all?). It ought 
not to be too much of a problem in the shortest aeoess time oase, 
since eaoh time a request is to leave the queue, it has an 
equal chance among all the others of being chosen. This Is 
only partially true for the Shortest Job First Case, where 
the Job time has a random component , the access time; and if 
the transfer time component be very long, then the Job time 
depends almost entirely on the transfer time. Notice that for 
very short transfer times, the shortest job queue will approach 
in operation the shortest aocess queue. One way of circum- 
venting the problem of some request waiting inordinately long 
is to Introduce a skip limit into our model. Each time a request 
is skipped over as next out, a counter associated with that 
request is incremented. If this skip count ever exceeds the 
skip limit then this request is next. What we have done 
in essence Is to add a first come first served component to 
the queue. As the skip limit Is lowered a shortest aocess time 
or shortest job queue behaves more and more like a fiarst come 
first served queue. In fact a queue with the skip limit set 
to zero is just a first come first served queue* 
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We will Include no more discussion on mixed policy 
queues since the problem is in general complex and unsolved. 
We will, however, mention the skip limit once again in Chapter 5 
under the discussion of simulation results. Finally, another 
mixed polioy queue is disoussed in Appendix 3. For further 
discussions on the matter, the reader is referred to the 
literature (1,9,13,15.16). 

(5) A Comparison of Queue Disciplines . 

In Appendix 1 we have related the mean number in the 

service system, which includes those being serviced and those 

in the line, to the mean and variance of the service time 

distribution. We will denote the number in the system by L. 

The random variable of the service time, t„, is the access 

time, t , plus the transfer time, t,.. The service distribution 
a v 

can be found from a convolution of the access distribution with 
the transfer time distribution. We will show later that 
both of these can be found, hence the service distribution 
can be fou'-d. In particular, the mean service time, T o , is 

a 

T « T + T + (4) 

SSI' 

2 

And the varianoe of the service t ime , a * , is 

's ' < * "a <5> 

since we are assuming independence of t ft and t t » Let us 

2 

denote the function relating L, T , and a. by 

8 o 

L * P( V a s ) * F( V T t» °a 2+ °t 2 > (6) 



37 



The function F from equation (6) id such that a decrease 

2 

in either or both of T and a will result in a reduction 

in L. The transfer time distribution will remain the same 
for all queue disciplines sinoe It is a function of the number 
of pages per segment, which is fixed before hand, and is 
assumed to be identical for all processes. 

Let the total number of processes in all be N. Then 
N ■ w ♦ L (7) 

where w is the number of working processes. The efficiency 
of a system can be measured orudely by the number of processors 
working, and is 

efficiency - | » N g V - 1 - Jf (8) 

To maximize the efficiency, L must be minimized. Thus the 
optimum queue discipline is the one for which is minimized. 
Notice further that the number of processors that can be 
kept working is just the number of working prooessrsi 

number of busy processors ■ w b Nfl - ^) (9) 
For the simplest system, the single -processor system, w must 
never be less than one if the processor is to be continuously 
busy. 

On the average the transfer time is the same for all 
queue disciplines (because it relates direotly to the number 
of pages in a segment). On the average the aooess time is 
explicitly minimized only by the shortest access time queue. 
Therefore the service time for the shortest aooess queue will, 
on the average, be a minimum, compared to other queues. 



?. ^nr^ ? ? & ^ v -^./ w ' ^ 



38 



We see that the shortest access time queue minimizes the 
service time, while the queue which bears the * shortest Job 
first" does not minimize the average servloe time. The 
apparent oontrad lotion is resolved when we realize that the 
shortest access queue chooses the shortest Job first on the 
average . while the "shortest Job first"* queue selects the 
shortest Instantaneous Job. Nevertheless equation (6) tells 
us that the shortest access queue must have minimal L associated 
with It, and is therefore the most efficient*. In fact, 
any queue which does not minimize the access time must be 
less effioient than a shortest access time queue, when 
efficiency is defined by equation (8). Chapter Ik is devoted 
to a detailed study of this queue. 

Based on the above discussion, we give the Model of the 
Queue in Figure 3.10. 



♦This conclusion is verified by simulation. See Chapter 5. 
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npg « no. pages requested 
rd « read request 
wr * write request 
drm ■ the drum desired 
drmpos « the starting position 
drmbsyli) - Indicates ith. drum busy 
nq« number in queue 



Figure 3.10 . The Queue Model. 
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?.4 . The Drums . 

Since the method of distributing pages on the 
drum is of considerable Importance, we will discuss it first. 
Consider Figure 3.11. The drum, we suppose, Is divided into 
seotors as viewed from a cross- sect ion, where the number of 
sectors is an integer. The number of words per page is Just the 
number of words that oan be written around the circumference 
of the drum divided by the number of sectors. The drum Is 
divided into rings, and the width of one suoh ring is a field . 
A field Is one word in width, and a word Is typically 36 binary 
bits. Each field is subdivided into a number of tracks t each 
of which is associated with one read-write head . The same 
head Is used for reading and for writing* a read amplifier 
or write amplifier is connected as needed. The operation a' 
head Is presently performing is called Its status , and there 
is a delay associated with switching between read and write 
status. This selection delay is about the same time as for 
three or four words to pass beneath the head, so ordinarily 
the first few words on a page will be left blank to allow 
for this delay. 

It was stated previously that it is possible to write 
a segment of N pages on the drum contiguously. We indicate 
how this oan be done. The question Is: suppose some of the 
seotors in a field are used, how oan a string of N pages be 
written consecutively, especially if a page would have 
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Shaded squares are a consecutive 
string of pages, each with a 
pointer to the next. 




— Tracks per field 



<3^ 



Fields 



Figure 3.11 . Organization of the drum. 
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to be written on a used sector? The answer is that we 
do not attempt to write the pages in the same field. We 
require only that during a write operation there be at least 
one free field per sector. First of all, suppose that each 
sector was allowed to have all but C of its fields in use, 
where C is the number of channels to the drum, and where 
any channel can access any field. Suppose further that 
whenever fewer than C fields were free on a given seotor a 
deletion occurred immediately. Then the drum could handle 
C simultaneous write requests beoause a free field can always 
be found. In reality a delate might not occur when necessary, 
and also there is the possibility that a segment is longer 
than the number of sectors, which implies that more than one 
of its pages would be written on the same sector. It would 
be better to set a drum occupanoy level , which is the ratio 
of allowed fields per sector to the actual number of existing 
fields per seotor: 

occupanoy level < — p — (1) 

where P is the number of fields per sector, C is the number 
of channels to the drum. Then whenever the occupancy level 
is exceeded, some sort of emergenoy condition would be set 
up, and any unnecessary segments would be removed from the 
drum (they would be deleted, or they might be moved to a 
lower level of storage, for example a disc file). In such 
a case the oooupanoy level would have to be less than the 
upper bound set by the equality sign in <1) to allow for 
statistical fluctuations. Simulation has shown that 
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ocoupaney levels In exoess of 93% are possible with a proper 
deletion policy, for typleal parameters. See Chapter 5* 

When a write request comes, the pages are written 
on the drum on the first free field on each sector, and a pointer 
is left, to direct a read operation to the next field of a 
oonseoutive string of pages. These pointers are indicated by 
arrows in Figure 3.11. Thus it is possible to have a string 
of consecutive pages written (and read) without interruption. 
We require rapid inter-field switching, a feature available 
on high-speed drums. It is to be noted that if the drum is 
be be operated this way it will have to maintain its own 
"Field Usage Table* similar in principle to the "Traok Usage 
Table* used in CTSS with the disc(2). When a write request 
arrives, this table is consulted to l&oate the nearest 
free field on the given sector. 

As long as the Supervisor's delete Ion policy sees to 
it there are always sufficient free fields on each seotor, the 
drum operation is straightfot^ward. The delete mechanism shown 
in Figure 3.12 determines how many pages are to be deleted from 
the drum; it does this whenever the desired occupancy level 
is exceeded. We may model the behavior of a deletions by 
picking a random drum address and deleting one page from each 
sector until N pages are deleted. A "deletion" may be to 
remove the of fending pages to a lower level of storage, or 
it may be to obliterate the pages entirely. 
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Onoe a channel Is assigned, the drum observes whether 
the request is a read or a write, and switches the heads associated 
with that field to that status, as soon as the starting sector 
is apposite the heads. Hote that the set of heads associated 
with a giTen field may be in use by different requests from 
sector to sector. When the starting page is in position, the 
data transfer begins, allowing tine for the switching delay 
at the top of each page. Three or four words left blank on 
a page is sufficient time for this. At the bottom of each page 
is an End of Page mark, with a pointer to to the field containing 
the next page, which initiates switching to that field; of 
course the heads there are put into the proper status. 
Finally, at the end of the last page of the segment, an End 
of Pile mark will be encountered, and the channel is freed 
for the next request. 

The ideas for the drum model are embodied in Figure 3-13. 
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Figure 3.12 , The delete ■eohanisa. 
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Figure 3.13 . The Drums Model , 



46 



CHAPTER 4. THE SHORTEST ACCESS TIME QUEUE . 
4.1. Introduction . 

In Section 3.3 it was shown that the shortest access 
time queue discipline is the most efficient; it is the purpose 
of Sections 4.2 and 4.3 to analyse this queue as best can 
be done. The shortest access time queue is a special form of 
the shortest Job first queue. Solutions have been obtained 
for shortest Job first queues with the input rates independent 
of the queue length. No solutions have been obtained for a 
shortest Job queue in which the input rate is dependent on 
the queue length, that is, when there is only a finite number 
of requestors. In the next two sections we do not attempt 
to solve for the probability densities of queue length, 
waiting times, and service times; rather we talk only of the 
averages, which become time Independent at equilibrium, when 
the input rate to the system is the same as the output rate 
from the system. In Section 4.2 we derive a probability 
density function for the minimum access time as a function 
of the mean number in the queue; then in Section 4.3 we combine 
these results with the results of Appendix 1 to obtain some 
approximate expressions for the number in the queue, and for 
the waiting time in the queue. 
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4.2. The Minimum Access Time Distribution . 

The access time Is defined as the time from the exit 

of a request from the queue until the requested starting 

sector has come opposite the read-write heads. It Is simply 

a positioning: time. We hare shown in Section 3.3.5 that a 

queue discipline which minimizes the access time Is the most 

efficient; we wish to derive the access time distribution 

In this section, and in a later section we will determine the 

waiting times in queue using the Pollaczek-Khintohine Formula 

(Appendix 1). We define the xllowing quantities, given 

that n are In the queue: 

R. * requested starting sector of the drum for the 1 
request in the queue. 

D(t) * The angular drum position at time t. 

A,(t) » required access time at time t for the 1 
1 request given that the present drum position 
is D(t). 

a - random variable of minimum access time, which 

takes on values a . 
o 

T s drum revolution time. 
The model of the shortest access time queue discipline shown 
in figure 4.2.1 best Illustrates what is going on. 

The comparators compare the requested starting sector 
with the present drum position and give as an output the 
required access time. The Min(.) box selects the minimum of 
its inputs and sets its output to this value. To simplify 
the derivation we will assume that the Mln{.) box normalizes 
its output with respect to the drum revolution time T, so 
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D(t) 
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Figure 4.2.1 . Operation of shortest access queue. 

that a Is a fraction "between and 1. We have 

a - J.HlnCA 1 (t) f A 2 (1b) f ... f l h (1?>] (1) 
where < a < 1 # We are interested in the probability 
density of a as a function of n, the number in the queue. 

It was stated in Seotion 3.1 that the probability density 
of the R. is uniform, that is, all drum sectors are equally 
lively to be requested. Further we are assuming random 
segment lengths. If segment lengths and starting positions 
are random, the present drum position, which is the drum 
position .just at the finish of the last request (so that; 
the next request is about to be assigned), is random, and by 
symmetry and the independence of requests, we may assume that 
it is uniformly distributed.* 



♦This is not true in the case of short segments because the 
drum will have rotated only a short distance. This matter 
is discussed further in Seotion 4.2, page 63. 
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But If D(t) and B^ for each 1 are uniform, then A^t) must 
be uniform for each ij that is, the i th requests access time 
is equally likely to be any fraction of a drum revolution. 
Figure *K2.2 shows the density function for A 1 (t) t which has 
been normalized with respect to the drum revolution time, T. 



P A (t/T) 
A i 




» 



t 
f 



Figure *K2.2 . Access time for 1 request. 
Now, the probability that a > a Q is just 

P[a > a Q ] - P£A 1 (tf)>a o V^^o 1 *' 

But the H, are independent, so that the AjCt/T) are also 
Independent, and 

PCa > a Q ] « P[A 1 (f)>a o ]...Pi:A n ( t,)>a o3 *' «.£ 
But PLA l (t t )>a o ] is just the shaded portion of Figure lt.fe',2, 
and is simply (1 - a Q ). Then 



*> > *J 



(1 - a/ 



(2) 



equivalent ly p£a < a ] » 1 - (1 - a )' 



and 



W - afc * a * v> 



,n-l 



P a (a Q ) - n(l - a ) n ' A (3) 

Equation (3) is the probability density of a, given that n 
are in the queue. Figure *K2.3 shows P ft <a ) for a few values 
of n. 
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Figure 4.2.3 . Shortest Access Time Distribution. 

By the definition of conditional probability: 

P aB (a o' n) = P a/*<»</ n > P N< n > 

where N is the random variable of the number in the queue, 
which takes on values n. Then 

P aN<V n) " n < X " a o' n-1 V n) 
The mean access time, a , is 



^^O 1 ^ 1 "'^ 1 ^ 1 *"- 
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Integration by parts over a leads to 



n«l n+1 N 

The seoond moment, a , is 

7~ = E l\*nil - a ) n - 1 P N (n)da 
n«l 

Integration by parts over a. Q leads to 

n=l 
The variance of the access time distribution Is then 

°a " ? - * Z = J.TsdW P N <n) 
n«i 

eo 

Note that we h*ve not specified P N (n) , the distribution of 
the number In queue. Note too that it is not the same as the 
time distribution of n. It is the distribution of number 
In queue as seen by the departing requests — we need Pjf(n) 
taken over instants when the next request is extracted from 
the queue, which does not happen at uniform intervals. It 
is a reasonable assumption* that P N (n) is a normal distribution. 
This is only an approximation, since the normal distribution 
would allow for some probability of negative n, which is 
physically meaningless; this must be used carefully when 



♦Based on the Central Limit Theorem. 
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n is small enough so that the portion of the normal ourve 

extending below n*0 Is appreciable, espeolally when the variance 

2 — 

of P N (n), a n , Is large, so that o^^N. p j|( n ) *» 



V n) " 7="" «PC-*< n * S > 2 /°n ^ (8) 
V2no n 

Putting (8) Into (5), 

""ia^^T 6 ^-^-^ " 3 (9> 

And putting (8) Into (7), 

"• " Ji ik" WW "*-*<» - 5>2/0 » 3 < 10 > 

n 
Equations (9) and (1) cannot be reduced further, even If 
the summations are taken to be Integrations over the Infinite 

interval. These equations do, however, yield readily to a 

— 2 

computer, and families of curves for a and o^ have been 

assembled and are shorn in Figure 4.2.4 and 4.2.5. The 

axes are normalized so that, given the drum revolution time 

T, values of aocess time oan be found. 

We wish to note the limiting forms of equations (9) 
and (10). These occur for n » 1, and for a « n. Figures 
4.2.4 and 4.2.5 show that for n > 8 we may ignore the effects 
of c n , for o n of interest (see Section 5.2), with only a 
small error. Now if a n Is very small compared to n then 
the normal curve approaches a unit impulse in the limit, 
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and the summations of equations (9) and (10) reduce to a 
single value, taken at n « n, Thus for a n « n: 

a = rjr at n * n (11) 

°a c n+T ^ n/(n+2) at n - n (12) 

It is to be noted that (11) and (12) are evaluated at 
n * n, and that the approximation is very good if the conditions 
are a*t; this is evidenced in Figures k.Z.h and ^.2.5, where 
equations (11) and (12) have been drawn. One of the prime 
assumptions of this derivation is that the drum positions 
at successive request-granting times are independent. If the 
drum positions at successive request -granting times are not 
independent, then the access distribution is in error. This 
is the case if the average length of requests is small compared 
to a drum revolution. See the discussion on page 63. 

In Appendix 2 one further result of interest is 
obtained. The form of the probability density for the waiting 
time in queue is derived and Is shown to be exponential. This 
Is In excellent agreement with the simulation results discussed 
In Section 5.2. 
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fr.3 . Examination of Shortest Access Time Queue . 

The solution to a queue Ing problem In which the pdlioy 
Is based on a continuous number of priorities, such as the 
shortest ,}ob first and shortest access time queues, Is not 
easily obtainable. In particular no solution has yet been 
obtained for a finite requesting population, under a shortest- 
job-first type queue discipline, since the arrival rate of 
requests tends to depend heavily on the size of the queue 
and the service time. As the queue become full, the rate 
of arrival of requests tends to slacken because there are fewer 
members of the requesting population outside of the servloe 
system. In this section we will derive a set of approximate 
equations for the number in the queue as a function of inputs 
and service parameters, and indicate an iterative procedure y 
for solving them. 

We suppose that the queue is in statistical equilibrium, 
that is, the system has been in operation sufficiently long 
that the time average of number In the system is constant. 
We shall use the following notation: 

n ■ the mean number in the queue. 

W = mean wait on a request in the queue. 

Q 

T « drum revolution time. 

T ■ mean service time, 
s 
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T. = mean transfer time. 
T a « mean access time, 

a = mean arrival rate. 

b = mean service rate. 

A = mean interval till the next request from one 
process, from the time it resumes. 

s «= mean number of pages per segment. 

m a number of sectors around the drum. 

N - population size, i.e., the total number In 

the queue, plus the number in service, plus the 
number generating requests. 

r = traffic intensity ratio, i.e., the average 
number of busy channels. 

In the previous section we saw that due to independence 

of requests, random segment lengths, andrandom present drum 

position, that at each request-granting time, each request 

was equally likely to be next out. We have a series of 

Bernoulli trials, then, with a probability of -J of a particular 

request being picked at a given trial, and probability 

(1 - ^) of being overlooked, where k is the number in the 

queue at the time of the trial. On the average we can say 

that the probability of being chosen on any trial is approximately 

— , where n is the average number In the queue. Therefore 

the probability of being chosen on the k th reque sting -gran ting 

time after a request enters the queue is, on the average, 

given by a geometric distribution, which we denote by P(k). 



Then 
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p(k) - (i - i> k " 1 (J) u> 



We wish to determine the waiting time of a request in the 
queue. The z-transform of equation (l), which we denote 
by p£<2), Is 

p£(z) - jhi - !)"(!) z* 
which can he reduced to the closed form 

Pk<«> - » - (n - l)z < 2 > 

The mean number of trials before the given request is next 
out is 

And the variance is 

o k 2 «n(n-l) CO 

The average wait is Just the average number of service 
intervals that must pass while a request is in the queue. 
If a request arrives Just before a service begins it must 
wait only (n - 1) intervals; if it arrives Just after a 
service begins, it must wait n intervals, as given by 
equation (3). On the average, then, it must wait (n - £) 
service intervals. The wait in the queue is therefore 

W q = (n - *) T s (5) 
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We suppose that eaoh Interval Is of duration T . where T 

8 S 

is the mean service time, and 

and where T is the mean access time. For n in the queue, 
we can use equation (11) of Section fc.2, so that 

It is reoalled that T ■ T/(n*l) is an approximation, becoming 

more accurate with increasing n. The mean transfer time is 

the time to service the mean number of pages per segment, 

which is 

T t - T £ <7> 

where s *= mean number of rages per segment, 

m « number of sectors around the drum. 
By putting (7) into (6) we obtain 

T 8 -*<;&♦ = > < 8 > 

We have noted that the shortest access time queue is a 

random output queue, so that we can use the result of 

Appendix 1, which says that 

Mean number in the service system ^ mean number in queue 
mean service rate mean arrival rate 

(9) 
where the mean service rate is b = l/(mean service time), 

and the mean arrival rate is a * 1/ (mean arrival time).* 

_ 

We are assuming as in Section 3.2 that arrivals are Polsson, 
and that segment lengths are Pols son distributed. That is, 
the probability of exactly k requests in a time t is 

P(k,t) » (at)* e" at t * 

(continued) 
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Equation (9) is exact only when the arrival and service rates 
are independent of the number in the queue, which is not 
the case in the finite population system we are discussing. 
We can use equation (9) because there must exist an equivalent 
infinite-population system whose equilibrium arrival rate is 
the same as the arrival rate to the shortest access system 
when it is in equilibrium. We proceed to substitute the 
appropriate quantities into {9) and then solve for n, the 
mean number in the at equilibrium. 

Pirst note that r, the traffic intensity ratio , is 



where a is the arrival rate at equilibrium. The probability 
of finding exactly k segments in a block of n pages is 

P(k t n) - &$£ e- n/s n-1,2,3,... 

where s is the mean number of pages per segment. The 
waiting time between polsson arrivals is 

P(t)dt * P(no arrivals during time interval t) 

X P(one arrival in time interval dt) 
*k 



*#-.-*]«(•>(**> 



so that P(t) m ae" at t > 0. 
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also the time average number of busy channels: 

r m a mean arrival rate 
be" = mean service rate 

where o * number of channels, 

b = mean service rate 

a ■ mean arrival rate. 
The mean number In the service system Is just (n+r). Now 
If the Interarrlval time for one working process Is A, then 
at equilibrium It must be, for all working processes, 

<S - n - r> " J < 10 > 

Because (N-n-r) are not In the service system, and are therefore 
making requests. We can now fill In (9) to get: 



*(% - (H - £ - rl <«». 



A 

We define a quantity R to be 

T 
r B -S * JL ( _L- + £ ) (12) 

a oA cA K n+1 * m ' KX£t 

Note that R is an intensity ratio for one process, and 

T 8 - a A c (13) 

Then the intensity ratio r is 

T 
r » ^ * / — * R (N - n - r) 

W-rihrrT iM 

Solving (14) for r, we find 



r » (N - n) y~Tr (1 *> 
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After putting (15) into (11) and performing the appropriate 
algebraic manipulation, we find 

n * g (16) 

The form of equatlon(l6) has been chosen because it is 
solvable by a process known as relaxation (or iteration), 
in which a guess at n is put into the right side of <l6), keeping 
in mind that R « H(n) . A new value of n is obtained. This 
new value of n is placed into the right side of (16) as 
before, yielding yet another value of n. This process is 
continued until the new value of n is the same as the previous 
value. It was found that (16) converges rapidly, within 
five cycles. 

Collecting the results, 



N 

1 — (l?a) 



V - (n - *) R.A o (17b) 

T- R A o (17c) 

s 

A simulation has been carried out to test equations (17). 
The value of n was found to be within 1% of the simulated values* 
the value of W was within 3<« These answers were considered 
satisfactory in view of the approximate nature of the 
derivation. 

Due to the nature of this problem we are unable to 
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say anything about the standard deviation of our results. 
Simulation has shown that the standard deviation of the number 
in queue is less than 1.0, while the standard deviation of 
the waiting time was in general somewhat larger than the 
m*an. In particular, one simulation reported a maximum 
wait of about ten times the mean. 

As a final note we want to point out that one of the 
basic assumptions of this seotion and the previous section 
is that the drum position is random at each request-granting 
time. This means that the drum positions at successive request- 
granting times are independent. But this need not be the 
case. Suppose for instance that the transfer time, T t , is 
a small fraotion of the drum revolution time,T (for example, 
suppose the average transfer time, T t m 0.1T). Clearly, 
if this Is the case, the drum positions at successive request- 
granting times are dependent, because we can say that the 
probability of the drum being only 0.1T away is much greater 
than being, say, 0.5T away. This is obviously contradictory to 
the assumption of independent drum positions at successive request- 
granting times. Consequently we expect the access time to be 
below the predicted values, since the probability of finding a 
request wanting the present drum position is greater than 
If the drum position is random. If the access time were 
smaller than the predicted values, then both W and n would 
be smaller than predicted, T would be smaller, and the 
system operation should be more efficient. Simulation has 
shown that this is the case, that effiolenoy is increased 
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when segments are short. In particular, since (N-n-r) 
processes are working, then the fraction of processes that 
are working Is 

(» - n - r) _N - n (18) 

* N L m 11(1 ♦ ft) 

If n substantially decreases, by (IB) the efficiency sub- 
stantially Increases, The greater the efficiency, the 
greater the number of processors that can be kept busy. It 
Is to be noted that when T t Is of the same order of magnitude 
as T, or larger, then the drum positions at successive 
Intervals become independent, and the analysis of this 
section Is valid. 
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CHAPTEB 3. THE SIMULATION RESULTS . CONCLUSIONS . 
5.1. Introduction , 

In order to observe the operation of the model of 
the entire drum system, which Is discussed In Chapter 3t 
It was decided to simulate the system. Project MAC computation 
facilities were used; the simulation was written In SIM, a 
new simulation language conceived and Implemented by A.L. Soherr 
at Project MAC (1?). SIM Is an augmented version of the 
MAS programming language, adding several new statements to 
those already existing In MAD. It has the powerful advantage 
that the logical flow of the simulation Is the same as the 
logical flow of the actual system. Each element of the system 
(see Figure 3*1) » namely the processes, the queue, and the 
drums, Is specified In the simulation as an Element (which 
Is translated Into a MAD external function by a SIM pre- 
compiler). The lnter-elemental signals shown In Figure 3.1 
are Implemented In SIM by system variables , which allow a 
signal to be transmitted from one element to another. A 
main program oalled SIMSYS coordinates the activity of the 
elements. 

Three simulations were run. One was a simulation of 
the entire drum system dlsoussed In Chapter 3« Another was 
a simulation of the shortest access time queue dlsoussed 
in Chapter 4. Section 5.2 discusses the drum simulation, and 
Section 5.3 discusses the queue simulation. A third simul- 
ation was used to develop Appendix 2, and is discussed there. 
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5.2. The Drum Simulation . 

The three elements of the simulation were the Users* 
Processes, the Queue, and the Drums. These elements and the 
signals that were passed among them are shown in Figure 3.1, 
The logical flow of each element is the same as shown in 
the flow graphs of Figures 3.8A, 3.8B, 3.10, and 3.12, where 
the models of the Processes, the Queue, and the Drums are 
depicted. 

CTSS has available a random number generator, which 
is useful in the simulation of the Processes to generate 
the probability distributions discussed in Section 3.2. 
The random number generator returns a number between zero and 
one from a uniform distribution. This can be used to get 
numbers from other distributions in the following manner. 
First the cumulative distribution of the given distribution 
is found, which will have probabilities varying between zero 
and one. The random number generator can be used to select 
one of these values of probability. This value is substituted 
into the cumulative distribution which has been solved for 
the random variable. In the drum simulation numbers from 
exponential distributions were needed. Such exponentially 
distributed random variables can be obtained in the following 
manner* Suppose we want to select a random number f»om the 
exponential distribution of interarrival times, which has 



been shown to be 
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P(t) = ae" at t? > (1) 



Denoting the cumulative distribution by Q(t), ire have 

Q(t) * / a*-** dt = 1 - e" at (2) 


Solving for t, 

t - - J in (1 - Q{t)) (3) 

But in Q(t) all probabilities in the interval (0,1) occur 
uniformly, so we can use the random number generator to 
select a probabilty Q{t); substitution into (3) yields 
the desired exponentially distributed random variable, t. 
Equation (3) was used in the Process Model to select 
waiting times til the next request, and to select the 
number of pages in a segment. 

The following data were taken during a typical 
simulation, for each queue discipline: 

(1) per cent process idle time; 

(2) waiting time in the queue; 

(3) number in the queue; 
(If) service times; 

(5) access times; 

(6) channel idle times; 

(7) number of fields used per sector on the drum. 
The following set of parameters was considered typical. 
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Fractional drum occupancy #90 

Number of processes 20 , 

Mean Inter-request time 15. msec. 

Mean number pages per segment 10. 

Read-write ratio 3. 

Number of drums 3 . 

Number of channels each drum 3* 

Number of fields each drum 256. 

Number of sectors on drum 64. 

Number of words per page 64. 

Drum revolution time 16.7 msec. 

The following per cents of process idle time were found for 
each queue disciplines 

First come first served 55* 

Shortest Job first *&% 

Shortest Access time first M# 

Other simulations using modified sets of parameters 

(for example, two drums with two channels e*ch; or longer 

service times, that is, more pages per segment) showed the 

same result — the shortest acoess time queue discipline results 

in minimum idle time. This point has been discussed under 

our comparison of queues in Section 3.3.5. 

Probability distributions of all data were taken. 

Three of them were of particular interest, and are reproduced 

hare. These were the waiting time in queue, the number in 

queue, and the number of fields used per sector per drum. 

These are plotted in Figures 5.1 f 5.2, and 5.3 for eaoh 

queue discipline. The means and the maximum points are 



;:-- _..-fe^s^rf; "■■-— ;-../-;?--' -.,:■ :;>■-■ r> i ~.*t^.-x&$^j&t&*&l>t*> l '" " ' -*tS^^-*TM«ff**-K ^ 
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Indicated. It la notable that the mean wait for First Come 
First Served was 17.1 msec, while for Shortest Aooess Time 
First and Shortest Job First It was significantly less, 
6.8 msec for Shortest Job First and 6.3 msec for Shortest 
Access Time First. Again the Shortest Access Time Queue 
lead to the minimum wait. It Is also of significance that the 
shape of the waiting time In queue dlstrltutlon Is exponential 
as predicted by Appendix 2. The number In the queue (Figure 5.2) 
was about 8 for First Come First Served, and half that for the 
other two queue disciplines. The number of fields per 
sector per drum (Figure 5.3), Is not dependent on the queue, 
but Is dependent only on the deletion policy, which Is shown 
in Figure 3.12. It Is Interesting to note that It is 
normally distributed, and that at desired occupancy level of 
90* the maximum data point was 2^2 out of 256 fields used (95*). 
The mean was 230 fields used (90*). Tnis was for a sample 
of 2^,500 points. We conclude that oooupanoy levels In excess 
of 90* oan be maintained without overflow. 

The remaining three distributions are not plotted here, 
but we will discuss each briefly. The service distribution 
was found to have approximately the same shape as the 
number of pages per segment distribution, but It was distorted 
due to the inclusion of the aooess: time In the servioe time. 
The mean service time was found to be the sum of the mean 
access time and the mean transfer time, as expected. 

The aocess time distribution was uniform for First 
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Come First Served, with a mean of 16.7/2 msec « 8.3**- msec, as 
expected. For Shortest Access Tine First this distribution 
was found to follow closely the predictions of Section <t.2. 
The access distribution for Shortest Job First was somewhere 
between the First Come First Served and Shortest Access Time 
distributions, as expected. 

Finally the channel idle time distribution showed 
that there was an insignificant amount of channel idle time* 

Let us mention what the maximum waits in the queue 

were. First Come First Served had the smallest maximum 

wait, as expected, and Shortest Job First had the largest. 

Some numbers are, for the typical parameters listed on page 68, 

First Come First Served. ......... 62. msec 

Shortest Acoess Time 65. msec 

Shortest Job First 100. msec 

Note that the Shortest Access Time does not cause waits 

too much longer than the First Come First Served Queue. 

Other simulations were run, in which the Shortest Job 

First queue was observed to have a maximum wait of b sec, 

for parameters not too different from the ones listed on 

page 68. 

A last point: queues in which the skip limit* 

was used h*»v* a "First Come First Served" component, and 

are accordingly less efficient than a Shortest Acoess Time 

queue. A skip limit of ten in a Shortest Access Time queue 

caused its efficiency to be only slightly greater than 

♦Section 3.3.*K 
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Figure 5.4. Probability density of number of 
fields used per sector. 



a First Come First Served queue. 

5.3. Shortest Access Time Queue Simulation . 

This simulation was composed of two elements, one 
to make requests, and the queue. With Section 4.3» the 
arrival rate of requests at equilibrium is 

(4) 



where 



(1 + r)j 



*»&<irTT + i> <5> 



and T « drum revolution time, 

A = inter-request times per working process, 

c * number of channels, 

s - mean number of pages per segment, 

m = number of sectors per drum, 

n s mean number in the queue. 

The simulation was seeking to test equations (1?) for 

Seotion 4.3^ which are 

n«~ -S g (6) 



1 + *Rn\% 



H « (n - t) B A o (7) 

Four single-channel (c = 1) simulations are considered here. 
The parameters were: 

Parameter 12 14 

set number ~"^ J 

T 16.7 msec 16.7 msec 16.7 msec 16.7 msec 

A 15.0 msec 5*0 msec 0.5 msec 1.0 msec 

s 4.0 8.0 40.0 2.0 

m 64.0 64.0 64,0 32.0 

N 20.0 10.0 25.0 5*0 
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The results, were, for simulation samples of about 3000 data 
points, as follows. 

n W 

Parameter q 

set number predicted simulated predated sflimflafrefl 

1 12.85 12.97 27.78 msec 26.91 msec 

2 7.96 7.99 29.48 " 29.01 " 

3 24,00 23.68 260.98 " 259.12 H 

4 3.98 4.00 15.30 » 16.54 " 



It Is apparent that the agreement Is good. 

One last point 1 In Section 4.3 it was mentioned that 
If the drum position Is not random, that is, when short 
segments were used, then the access times should decrease, 
and In particular the number In the queue and the waiting 
times should decrease. The following simulation verified thlsi 

Parameters : 



T «= 16.7 msec 






A ■ 7.0 msec 






s - 3.3 






m « 64.0 






Results : 

n 


Random 


Function of time 


3.86 


17.11 msec 


2.48 11.40 msec 



There Is a significant difference, and fortunately the 

errors are in favor of much increased operational efficiency. 

Prom Section 4.3 the efficiency is 

N - n 
N{1 + t) 

Efficiency t Random Function of time 

m 32* * 46* 



^¥Spr)V2TfI -vr \^rf^rrr.'r~^rr^^ 
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5.4. Conclusions . 

In this paper we have shown that for a segmented 
mult i programed, multiprocessor computing system, the 
following Is true i proper maintenance of auxiliary memory 
can greatly Improve system efficiency. We have shown how 
this can be done. In particular! 

(1) It Is generally possible to store pages consecu- 
tively on the drum, and proper deletion policy 
can be used to maintain oooupanoy levels In 
exoess of 90£. 

(2) The Shortest Access Time queue discipline is 
the most eff lolent queue for an auxiliary 
memory, where time is spent waiting for mechanical 
parts to move into some proper position. 

If request sizes are large, that is if segments 
oontaln many pages, then it is not diffioult 
to derive equations for the average number in 
queue, and for the average wait in the queue. 
If the segments are short, these equations break 
down, but provide an upper limit for the average 
number in queue and the average wait in queue t 
The error is in favor of increased efficiency, 
(4) A reasonable probabilistic model for the processes 
in a segmented computing system has been given 
in this paper. 
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(5) Simulation is a particularly useful tool for 
analyzing problems of the complexity of oomputing 
systems, for it is frequently helpful in providing 
a starting point for analysis. 

(6) Hixed-Polioy queues may be used in drum (or 
disc) auxiliary memory systems when we beoome 
concerned that some requests might hive to wait 
inordinately long. A "skip Halt* aueue was found 
to be more efficient than a "window" queue (see 
Appendix 2). 

5.5. Suggestions for future study . 

(1) ?2*£. deletion policy of the Processes model . 
Although it is possible to prevent drum overflow, 
and to maintain 90J< occupancy, exactly what deletion 
policy is the best, if any? See chapter 2 and 
Section 3.1* for discussion. 

(2) The "page-turning 11 vs. "segment-turning 11 allocation 
problem of Chapter 2 should be considered in 
detail. 

(3) The finite population, shortest Job type of 
queue is yet to be completely analyzed. 
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APPENDIX 1. THE POLLACZEK-KHINTCH INE FORMULA . 

In this Apn end It we will derive an equation which 
Saaty {16) refers to as the Pollaozek^Khintchine Formula 
(Samty, pp 40-43), Saaty has derived it for the poisson 
input, single channel, equilibrium queue. We will extend the 
reasoning to include the c-channel server. Since we talk 
only of the number in the system, the queue discipline is 
irrelevant to our discussion, until we begin to talk of 
waiting timet. 

Suppose that arrivals oocur at random according to 
a poisson prooess at a rate a per unit time, to a waiting 
line In statistical equilibrium, before a o-channel facility. 
They are served according to some arbitrary servioe-tlme 
distribution at a rate b per unit time per channel. We assume 
that if the service rate of one channel Is b per unit time, 
then it is be per unit time for all c channels operating 
together. Suppose that a departing request leaves q in the 
system behind, including those in service, and that some time 
t will elapse before the next departure. Let the waiting 
line increase in length by k requests during this one 
service Interval. If the next departing request leaves q 1 
behind in the system, we can relate q and q* as follows : 

q f *= max (q-l t 0)+k=q-l+d+k (l) 

where d(q) = if q > 
d(q) - 1 if q • 

By introducing d(q) we eliminate the max expression. 
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We assume that equilibrium values for the first and 

o 
second moments E(q) and E(q ) of the number in the system 

exist, where we are treating q as a random variable. We note 

2 2 
that E(q) ■ E(q*) and E(q ) * E(q ( ) since both q and q' are 

assumed to have the same equilibrium distribution* We observe 

that since equilibrium, eatoh departing request must leave 

behind identical time* Independent queues, each having the 



same probability distribution. Now, from the definition, 

2 
d := d, and 

(1) we have 



d :b d, and q(l - d) « q. Thus, taking the expected value of 



E(q*) * E(q) * E(l) + E{d) + E(k) (2) 

but since E(q) = E(q f ) we have 

E(d) » 1 - E(k) (3) 

During an lnter-departure Interval of length t we have 

oo k 
E(k) = £ k^- e" at « at (if) 
k*0 K# 

E(k 2 ) 5 Ek 2 %^ e" at « (at) 2 «■ at 

k«0 *• (5) 

Let us denote the combined servioe distribution for all 
c channels operating In parallel by S(t). Taking the 
expectation of E(k) with respect to this service time 
distribution we see that 



E(k) « / (at) S(t) dt 


oo 

«= a/ t S(t) dt (6) 





E(k) - ft - r 
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since the mean of S(t) is 1/bo. But since se have not 

specified S(t), E(k) ■ r is unaffected by the type of service 

distribution. Then 

E(d) = 1 - r (7) 

Now, if the probability of the queue increasing by k is 

independent of the length of queue, q f and of d f which 

depends only in q, any expectation over products of r, q, 

and d is Just the product of the respective expected values. 

Therefore 

E(k 2 ) * / ((at) 2 + at) S(t) dt 


which is an average over all time. But 

E(k 2 ) « / (at) 2 S(t) dt ♦ Mat) S(t) dt 


2 "5 

« bT t~ ♦ at 
s s 



But the variance of S(t) t o 2 , is 



8 

a 2 m ~2 T Z 

8 S S 



Therefore 



Finally, 



E(k 2 ) » a 2 (a I + t 2 ) + at 



s 



J2\ _ -2 2 A 2 



E(k*) = a^a* + r* + r r . £ (8) 
If we square both sides of equation (l)i 

q* 2 » (q - I) 2 + 2(q - l)(d * k) ♦ (d + k) 2 

q» 2 - q 2 - 2q(l - k) + (k - l) 2 + d(2k - 1) (9) 
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p 
Equation (9) was obtained by using qd « 0, and d = d. 

Because of equilibrium, 

E(q 2 ) - E(q' 2 ) = = 2E(q)E(k - 1) + E((k - l) 2 ) 

+E(d)E(2k - 1) (10) 

Recall that the validity of (10) depends on the independence 

of q and k. Solving for E(q) and using equations (6), (7), 

and (8), we have the Pollaczek-Khintchine Formula : 

i2. 



E(q) = «((*-lg (l »«flW2»-D 



a 2 c 2 + r 2 ♦ r - 2r ♦ 1 + (1 - r)r - (1 - r) 
a 2 - 2r 

* 2 + * 2 °? 

E <"> - r + 2(1 - >)' r * £ (11 > 

Thus, once we know the variance of the service time distribution, 
the average number in the system, E(q) is determined. It 
is important to note that E(q) is an average taken over 
instants Just following departures, and is not the time 
average. If E t (q) is the time average, all we can say 
without further argument is that 

E(q) < E t (q) < E(q) + 1 

In general the average number in the service system equals 
the sum of the average number of busy channels (here it is 
r = c— ) plus the average number in line. 

To obtain the average wait in the waiting line, which 
we will denote by E(w) , we observe that a(E(w) + ^ ) is the 
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expected number of arrivals during the expected time of one 
request in the service system, if the queue discipline is 
first come first served, because ^- is the mean service time. 
But this must be Just the number in the system immediately 
after a customer departs, that is, E(q), so 

r 2 ♦ « 2 o 2 
" M + ft " r + 2(1 - r) 

hat T " ft. so r 2 + a 2 o 2 

\ " E(w) = Vl'-rl tl2) 

We have pointed out that r is just the number of busy channels 
and that E(q) is the expected number in the system. Inspec- 
tion of (11) will show that the number in line, L . must be 

q' 

r 2 + a 2 c s 2 
L q = 2(1 - r) 
We have the interesting and important result 

\ - 4f- <«> 

Notice that this is exact only if the number in the system, 
E(q), is independent of the service time or the arrival 
rate, as pointed out after equation (10). We also note that 
if (W + ^) is the time of one customer in the service 
system, then bc(W + ~ ) is one more than the number in 
the system, E(q). This is so because if E(q) are in the 
system, then E(q) - 1 service intervals pass while one request 
is in the system. Therefore bcW = E(q) and we have the 



second result 



8* 



w s Mai (li0 

q DC 



In words* 

u _ average number In the line 
q average arrival rate 

_ average number In the system 
* average servloe rate 

These are true for arbitrary service distributions. 

It Is Interesting to note that If the service times 

are exponential, that is, the service follows a poisson 

law, then the Interval between departures is given by 

S(t) « boe'^ t > 
It Is a well-taiow fact that for this type of distribution 
the variance equals the mean squared, that is, 



a I « / t 2 boe- bot dt - trtbc e" bct dtj 2 
z 

= (1/bc) 2 

Substitution of this Into (11) yields 

S + *-*'♦ T^r <"> 

Prom which It follows that 

L = r ^- (16) 

q 1 - r 

and E(q) « y^ U?) 

Consider for a moment the geometric distribution 

P(k) » r k (l - r) (18) 
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It is known that this distribution describes the number 
in a service system with exponential input and output 
(Saaty, 17, pp 38ff) # The expected number in the system is 

L = £ k r k (1 - r) 

k » 

= (1 - r)r A £ r * 
ar k«l 



~ 1 - r 
which is the same as (17). Then we can find the variance 
of (18) which is 



£ k 2 r k (1 - r) - I? = (1 - r)r -*- r -£; L r k - L : 
k=0 dr dr k=0 



2 



L " (1 - r) (1 - r) 2 

o L 2 = L{L + 1) (19) 

We have the result that for the exponential input, exponential 
output system the number in the system is given by (17), 
the number in queue by (16), and the variance of the number 
in the system by (19). The results of this section will hold 
for queues in which the discipline is random as well as for 
first come first served. They hold for random disciplines 
because, on the average, the number of service intervals 
that must pass before service is the same as for first 
come first served. This is seen in Section J^.3. In fact 
the equation for the mean number in the queue, L is accurate 
if the following conditions are satisfied! 
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(1) all requests stay in the queue until served; 

(2) the service time distribution for all channels 
is the same, with parameter b; 

(3) channels serve one at a time} 

(*0 a channel serves the next request, if any are 
are waiting in the queue, as soon as it finishes 
with the last request, 
A little thought will show that if these four rules hold, 
the length of the queue is the same for all disciplines, 
although the mean wait, W , will vary, (Morse, 13 , P- H?) • 
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APPENDIX 2. WAITING TIME IN A SHORTEST ACCESS QUEUE . 

In this appendix the probability density funotion for 
the waiting time in a shortest acess time queue is derived. 
We define the following random variables: 

A « r.v. of access time, taking on values a. 

N • r.v. of number of requests to exit the queue 

before a given request exits, taking on values n. 

P « r.v. of number of pages per segment, taking on 
values p. 

R « r.v. of number of requests in the queue, taking 
on values r. 

T * r.v. of transfer time, taking on values t. 

W •= r.v. of waiting time in queue, taking on values w. 

Since at each trial (request-granting time) all requests 
are assumed to be equally likely to exit next (Section 4.2) 
the distribution of N is geometric. As on page 58, 
equation (1), the conditional distribution of N given that 
B are in the queue is 

P N/H (n/r) = (1 - J) n " 1 (J) (1) 

where R is the random variable of the number of requests 
in the queue. Denoting the density function of R as E fl (r) : 

P NR (n,r) - r M/R (n/r) »„<r) (2) 

We are interested in the wait in queue, so we have defined 
the random variable of wait to be W. Then 

Pyjjjjtw.n.r) * p v / W w/n,r * P N/R tn/rJ P R <r * 

(3) 



88 



Por a single channel queue the wait in the queue is N access 
times plus N transfer times. As in Section 3.2 we assume the 
number of pages per segment to be a random variable, P, where 

Pp(p) - o 2 p e" op W 

and o is a constant proportional to the mean number of pages 
per segment. If T* is the drum revolution time and S 
the number of seotors around the drum, then the transfer time 
for one pages is T'/S. Denoting the random variable of transfer 
time by T, we have for the density function of Tr 

P T (t) = k 2 t e" kt (5a) 

with the constant k defined as 

fc-r 5 - (5b) 

and jf fl is the average number of pages per segment. The wait 
in the queue is, from above 

W = N (A + T) ■ NA + NT « y ♦ z 

with y « KA and z * NT. 

Prom Section k,2 f the cumulative distribution of the 
aocess time is 

<A < a ] - 1 - (1 - a Q ) N 

But y = NA. Then 

P[y < a] « P[A< f] . 1 - (1 - §) N 

-1 -[(1 -{|r N /*]-* (6). 
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Now let u = -a/N. Then 

p[y < a] * 1-LU + u) 1/u ]- a (7) 

For large N, u approaches zero and we know 

u ^ [1 + U] 1 '" =e (8) 

Thus for large N^* 

p[y < a] « 1 - e' a 

and the density function for the access time component 
of the wait in queue is 

P y (a) - ^ P[y < a] « e" a (9) 

Using an elementary probability transformation, the density 
function for z = NT is 

V« - » V I > - £ » •■ <M,,/ * 

Defining ^ = k/K = S/T'NiJ s we have 

P z (b) =K n 2 be- K « b (10) 

Since A and T are independent random variables, the conditional 
density function for W, given N and R is 

P W/KH< w) " iV"- 1 ' F z U) dX 

the convolution of E(a) and P (b) . This evaluates to be 

VttH< w > • £_ 1)2 e ' W (12) 



This approximation is surprisingly good for N > 10. 
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Recalling equation (3), 

P WNR (w ' n ' r) ' P WAR< w/n ' r > P N/R (n/r) P R (r) 
Putting (1) and (12) Into (3), 

lK n -lJ 

WNR ((k/n)-l) 2 r r R 
If N is large, as it is assumed to be, then P«(r) Is 
approximately Normal by the Central Limit Theorem, and 



1018 '' ' ((k/n) 



ilj e - w (1^,-1(1, ^_ e -(r-R) 2 /2^ 

-i) 2 r r JSfw T 



(13) 

which is the required joint density function of waiting time 

in the queue. The simulation has shown that for the mean queue 
length, R, greater than about 10 with tr «I (which is the oase 
when R > 10) this approximation is valid. Thus in the 
steady state situation, it is clear that the probability 
density for waiting time in the queue is approximately 
exponential, a fact verified by simulation (Section 5.2). 
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APPENDIX £. DESCRIPTION OP A MIXED - POLICY QUEUE . 

The queue described in this section has been proposed 
as a shortest access time queue, but one for which we are 
concerned that a particular request may be continually over- 
looked due to the random nature of selection. Consider for 
example a queue which has many requests in it (at least thirty), 
Such a queue might occur if it were decided to request 
pages singly instead of in segments. In Section ^.3 it 
is shown that the waiting time of a request until it leaves 
a random output queue is given by a geometric distribution, 
with the expected wait equal to n service intervals, where 
n is the average number in the queue. Now if n is large, 
then it is conceivable that a request might have to wait for 
a very long time: the .variance of the geometric distribution 
is <n)(N - 1) » n 2 if n is large. 

Consider the queue shown in Figure A3.1. A new request 
is always added to the bottom of the stack. A section of 
the stack, of length N, is considered, the remaining requests 
in the queue being ignored for the while. We shall refer to 
the portion of the queue under consideration as being viewed 
through a window , of size N. The top of the window Is 
always at the top of the stack. The requests in the window 
are labelled R,,R 2 ,,..,R N , and are considered according to 
the shortest access time first queue discipline. Whenever 
the request marked R, is removed, the window is moved down 
until its tdp coincides with the next request R, . It 
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Next In 




S. WINDOW 



Fl poire A3.1 . Structure of a Mixed-Policy Queue, 
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appears that R, might have to wait until the N service 
time before it leaves, but no longer (by then it would be 
the only reauest in the window); thus it seems that an upper 
bound can be placed on the waiting time in the window, 
namely (N - 1) service intervals. But this is not so. 
Consider the request marked B N . Suppose by some quirk of 
fate that requests are serviced as follows. R, ,R 2 ,..., 

R N-1 ,R N+1» , **» R 2N» R N' Thls would happen if r i»*"» r n_i 
were serviced before R^ but then the window has become 

positioned at Rj,, and the next (N - 1) requests could be 

serviced before Rj,. It Is clear that the maximrun wait in the 

window is 2(N - 1). Since the arrival rate is given by 

an average 9 the expected wait before reaching the window is 

M; an upper bound to the wait is M + 2(N - 1) service intervals. 

We are assuming M > N. 

To find a lower bound on the waiting time, consider 

the following argument. Suppose a request enters the queue 

,1ust before the window makes a jump of N, then suppose the 

window moves one position at the end of each service interval. 

The request in question would then wait only (M - N) service 

intervals to reach the window. Then suppose it were let 

out immediately. The minimum wait is therefore M - (N-l) 

■ (M - N + 1). We have set an upper limit on the waiting time: 

W ™« T s (« + 2(N - 1)) t- (1) 

max s 



9^ 



and the lower limit of waiting time is 

w »ln= (M - < N - 1)) *s (2) 

Equations (1) and (2) assume that M > N. 

On the average the window is not full* We can think 
of the Droblem as a flow problem, with requests flowing into 
the bottom of the wondow at the rate of one per service 
interval, and filtering out through the window at the same 
rate. Let us imagine one of the requests being tagged so 
that we can keep track of it. If we know on the average how 
far down the window a request moves before it exits then 
we know the mean wait in the window. Simulation of the 
problem for several window sizes was carried out, and it 
was found that on the average the tagged request went 
half way down the window before exiting. Then we can write 
w av - (M * f) t. (3) 

Figure A3. 2 shows the probability densities of a request 
being at various positions in the window. It is to be noticed 
that the tagged request spends considerably more time at the 
upper and lower ends of the window then at the center. 

The standard deviation was found to be 0.8 of the mean, so 

N 
the contention that the request is a « on the average is 

not too certain. This implies that the probabilities of 

W and W . are not small. Figure A3. 3 shows the probability 
max min ^ 

density of window ,1umps. The average window Jump is about 

S. Figure A3.it- shows the following: the mean position reached 



95 



by the tagged request, and the mean window movement when it 
moves, both as a function of window size. 

Recall that for efficient access time queueing the 
mean in the queue had to be at least eight. Hence we would 
require that the window length be N > 16. But since M > N, 
the overall queue would have to have an expected length M + N > 32. 

It appears that the use of the minimum access time 
queue without the window, but with the "skip limit 1 * mentioned 
in Section 3.3.5 is better for the following reason. The 
skip limit could be set to an upper limit of 2 (N - 1) so that 
the maximum wait for that queue would be the same as given 
by equation (1), but with M ~ 0. Since the "skip limit* 
queue with the same maximum is longer than the corresponding 
"window" queue, the access time is shorter, and more efficient 
queueing is had. 
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APP BHDIX k, A CONTINUOUS - TIME MARKOV MODEL. 

With Howard (11, pp. 92ff) we define a rate Matrix [a], 

having elements a^,. The rate matrix is similar to the 

familiar Markov transition probability matrix except that the 

elements a ± represent transition rates from the i tn to the 

J state. The rates are assumed to be taken from exponential 

distributions. A transition matrix, then, is a discrete form 

of a rate matrix. Since we consider an equilibrium system 

the overall rate of change must be zero. Define a state 

probability vector P, where P = [p^Pg, . . . ,p M ] and p. is the 

prbbability that the system has i requests in it. Because 

of equilibrium, 

[P][A] = (1) 

We make the following assumptions. 

(1) All requests join the queue and do not leave 
until service is complete. 

(2) Each channel serves one request at a time, and 
does not begin the next request until the present 
request is finished. 

(3) As soon as a channel becomes idle, the next request 
enters service, provided there are some in the queue. 

W The queue discipline is first come first served, 
or else random. For any other queue discipline 
tint satisfies (1) through (3) the expressions 
for state probabilities and average number in 
line are the same, but the waiting time in the 
queue is not the same. See closing remarks of 
Appendix 1. 

We use the following notation: 

M =s the size of the finite number number in the 
total population being considered — it is the 
sum of the number in the service system dIus the 
number making reauests. 



T-"— * — ■ "— "■' 
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a ■ mean request rate per requestor, where 1/a * 
mean interarrlval Interval per requestor. 

b s mean service rate per channel, where 1/b * t g , 
the mean service time, 

c * the number of parallel channels providing service. 

p * the probability of the service system having n 
n of the M possible requestors in it. 

If the system in in state n (indicating that n requests are 

in the service system, and that (M - n) are remaining outside 

in the requesting population) then the rate of exit to the 

state (n+1) is nb f or n < a and is cb for n>o. We have the 

rate matrix 



[A] 



-Ma Ma 

b -b-(M-l)a (M-l)a 
2b -2b-<M-2)a (M-2)a 



cb 







-th 



At the c row the matrix is 



-cb-2a 
cb 

e 



2a 
-cb-a a 
cb -cb 



(c-l)b -(c-l)b-(M-c+l)a (M-c+l)a 

cb -ob-(M-c)a (M-c)a 

o cb -ob-£M-c-l)a .. 
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Because of equation (1) we can write 
-Map Q + hp 1 = 
Map Q - bp x - (M-Uap^^ + 2bp 2 = 

and in general 

<M-n+l)ap n _ 1 - nbp n - (M-n)ap n + ^n+l)bp n+1 = l<n<c 
(H-n+l)ap nwl - cbp n - (M«*n)ap n ♦ cbp n+ , = c<n<M 

Adding the n and the (n-l) equations, which is equivalent 
to adding adjacent columns in [a] , we have by recursion 

P 1 = Mrp Q 



so that 



p = M=l _ M(M-1> 2 
p 2 2 rp l ~ 2 r p o 



p n n» r p c 



p n - n.'{£-n)! ^o °^ n<c 

(2) 

P« = ~ttv f ~\ , r n -— — p c< n<M 
*n c!(M-n); „n-c *o 



a 



where r =^j . p Q is found from the requirement that 

M 

2 P n - 1 (3) 

n=0 n 



n f nTnSn7T r + n f c cTTSnTT r 7^ 



' ' ! J gg^?-;-^ ' " ! '^''g £ "' 
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If h Is the average number of processes actually In operation, 
k the number being serviced, and L the number In line, then 

k * h ♦ L « M (5) 

a* 

and because of equilibrium 

ft -t-* (6) 

The number being serviced Is 

c-1 M e-1 
k * E np + o S p = o - £ (c-n) p (7) 
n*0 n n«c n n=0 n 

The number In line Is 

M 

L - E (n-o) p„ (8) 

q n=0 n 

and as usual the waiting time In the line Is 

W -5l (9) 

q a 

The number In the system Is 

M 
L * L + k - £ n p M (10) 

* n«0 n 



The efficiency Is 



M - L - k 

number of working processes m Q 

M M 

H 
M - L n p w 
n«0 n 



The summations can be evaluated on a computer without too 
much difficulty if the factorials are expressed as logarithms, 

and use Is made of the fact that 

n 

n! ■ exp [ £ln (1)J 
1-1 



■ ■; ,- ^jfi^:.^,^ v.^,--^,-^; .;-••,.- j*W*-:. v>. . -*- 
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It is Interesting that a closed form for (10) can be 
obtained when there is one channel, i.e., when c=l. In this 
case equations (2) become 

Pn-T&^o "=0,1,..., M (11) 

and (4) becomes 

p , 1 r * £ (12) 

? M! „n 

Then L is the number in the system, and L-r»L is the number 
in the line. 



L = P I A,^ (13) 

n»0 lM ~ nK 



Consider 

? (M-n)MS ji _ 5 
p o n=0 tM " nJ * n«0 

expanding (14) we find 



" - L *»» * - A idKm * <»> 



£=£ « M + M(M-l)r + M(M-l)(M-2)r z + ... (15) 

p o 

But 

-^- - 1 + Mr + M(M-l)r 2 + M(M-l)(M-2|r 3 4- ... (16) 
p o 

Comparison of (15) and (16) reveals that 

Hdfc . (i - i ) 1 (1?) 

Solving for L f 1 - p 

L « M - — — * (18) 
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All that is needed to find L Is an evaluation of p . not 

an evaluation of each p n as well. The number in the queue is 

L q = L-r = M-r j-2 (19) 

So that the waiting time in queue is 

W = -SL « _ _ £ 12 

q a a a ar 

y s < M - ^^ » - s < 2 °> 

where t = ^, the mean service time. 

The interested reader is referred to A.L. Scherr's 
Doctoral Thesis, in which it is shown that Multiprocessor 
time-shared computing systems are in general, accurately 
described by Markov Models. 
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